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The present invention provides a method in vitro for the early diagnose 
ptog nosis and monitoring of cancer, or for the determination of the risk of 
development of cancer. The method of the invention is hased on the 
quantification of DNA in a plasma sample through PGR amplification. 

5 BACKGROUND ART 

A number of scientific or patent publications describe methods for 
detecting cancer based on the identification of specific genetic alterations of 

circulating DNA or RNA. 

US 5 496 699 discloses a method for detecting mutations in nucleic acid 
10 sequences, in particular the sequence of the K-ras gene, in biological fluids 

such as blood, serum or plasma. 

US 5 068 175 discloses a method for detecting the presence of ras 
pncogene related malignancies in which said gene is quantified in serum or 
plasma samples. 

„ WO01/42504 discloses the determination of extracellular nucleic ac.d, 

for example DNA of K-ras and APC genes, in serum or plasma samples, for 
the evamation of the risk factor related with a number of neo pl astic diseases. 

WO02/18652 discloses a method of qua.i/quantitative detection of 
human telomere RNA and telomere reverse tra^criptase RNA in plasma 

20 or serum for the diagnosis, monitoring, treatment or evaluation of a number of 

neoplastic diseases. 

In principle, the methods for the molecular characterization of gene 
aherations have low sensitivity and require the analysis of a large panel of 
gene markers to obtain an acceptable information level. 
25 Recently, a method has been reported for fl»e quantitation of circulatmg 

naked DNA in plasma from lung cancer patients, based on a coionmetnc 
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assay able to discriminate between patients and healthy subjects and to early 
detect the relapse of the disease during follow-up (Sozzi, G. et aL, Cancer 
Research 61, 4675-4678, June 15, 2001). The colorimetric techniques (e.g. 
DNA Dipstick) for the quantitative evaluation of circulating DNA, however, 
5 are limited by a narrow linearity in the range 0.1-10 ng/ml, and by reduced 

sensitivity to lower values. In addition the test reading relies on subjective 

evaluation. 

A more recent paper (Hsueh-Wei Chang et al., Journal of the National 
Cancer Institute, Vol. 94, No. 22, November 20, 2002) suggested the analysis 
10 of DNA concentration in plasma samples and the analysis of SNPs for the 
detection of neoplastic disease, in particular ovarian cancer. DNA quantitation 
has been carried out by analysis of fluorescence intensity generated by the dye 
PicoGreen® linked to double-stranded DNA. The results show that, contrary 
to SNP analysis, the method used for measuring plasma DNA concentration is 
15 poorly sensitive and specific, and therefore is not suitable for the screening of 
population for neoplastic disease. 
SUMMARY OF THE INVENTION 

The present invention provides a method for the quantitative 
determination of circulating DNA in a plasma sample from a cancer patient, a 
20 subject with familiar cancer susceptibility or at risk of developing cancer, 

which comprises: 

1) extracting the DNA from the sample; 

2) adding to the DNA preparation: a) a mixture of oligonucleotide 
primers suitable for PCR amplification of a fragment of the human 

25 telomerase reverse transcriptase (hTERT) gene, and b) an 

oligonucleotide probe, having at least one quencher and one 
reporter fluorophore at the 3' and 5'ends, annealing to a sequence 
within the region delimited by the primers, 
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in suitable conditions for carrying out a PCR reaction, 

3) adding a heat-stable DNA polymerase with 5'-3' hexonuclease 
activity and amplificating the hTERT gene fragment; 

4) measuring the produced fluorescence. 

5 The relative amount of DNA present in the tested sample can easily be 

obtained by the detected fluorescence values by interpolation of a calibration 
curve obtained with known amounts of DNA. 

According to a preferred embodiment, primers and probes are designed 
so as to amplify the fragment 13059-13156 of the hTERT gene (GenBank 

10 accession n. AF 128893). 

The described procedure, known as real-time PCR, can be automatically 
carried out using the apparatus GeneAmp 5700 Detection System (Applied 
Biosystem), which provides accurate quantitative DNA amplification through 
continuous optical monitoring of the fluorogenic PCR reaction, and it can 
15 quantify up to 1 genome-equivalent, corresponding to 6 pg of genomic DNA. 

The quantitation method of circulating DNA according to the invention 
can be applied to the early diagnosis, prognosis or clinical monitoring of 
patients with different kinds of cancer, including colon-rectum, head-neck, 
liver and pancreas cancer, particularly lung cancer. The method of the 
20 invention can also be used for the determination of the risk or likelihood of 
developing cancer in healthy subjects exposed to environmental or life-style 
risk-factors such as, in case of lung cancer, cigarette smoking. An increase in 
plasma DNA of these subjects is predictive of increased risk and therefore 
should prompt to repeat the test on an independent plasma sample, or to more 
25 thorough and specific clinical investigations. 

DETAILED DESCRIPTION OF THE INVENTION 

A simple, accurate method in vitro has been found for the determination 
of the presence of a neoplastic disease or for the evaluation of the risk 



PCT/EP2003/014719 

WO 2004/057024 



4 



connected with development of cancer, by quantitation of plasma DNA using 
the real-time PCR technique. More precisely, the invention provides a method 
for the quantitation of circulating DNA by continuous optical monitoring of a 
fluorogenic PCR reaction carried out on a plasma sample, using as primers 
5 and probes oligonucleotides which couple with the human telomerase reverse 
transcriptase (hTERT) gene. The amplification rate of hTERT is used as an 
indication of the DNA total amount in the tested sample. 

The procedure has been validated in a large case-control study (200 
subjects) involving patients with early stage lung cancer, in which during 
10 follow-up plasma DNA levels were determined after cancer resection, and 
healthy, heavy smokers matched by age, sex and smoking habits. The results 
of the study show that 1) the method of the invention has high specificity and 
sensitivity, as evidenced by the AUC and ROC curves (global value: 0.94, 
interval 0.907-0.973), 2) in patients, including those with very early stage 
15 cancer (stage IA), the values of circulating plasma DNA are approx. 8 fold 
higher than those found in healthy subjects, 3) in patients monitored during 
cancer resection follow-up, the DNA levels rapidly decrease down to a value 
similar to that observed in healthy subjects; in contrast, an increase up to 20 
fold has been found in individuals with recurrent cancer or metastasis. 
20 Although the highest sum of sensitivity (90%), specificity (86%), 

positive predictive value (PPV-90%) and negative predictive value (NPV- 
90%) was obtained with a DNA concentration value of 9 ng/ml, the 
confidence intervals (CIs) around these diagnostic indicators overlap with 
those of adjacent concentrations. The selection of the optimal cutpoint will 
25 therefore have to acknowledge this variability. The value of 25 ng/ml is the 
only cut-point shown with sensitivity that does not overlap with that of other 
cut-points, although it shows a low sensitivity (46%, 95% CI, 36%, 56%). 

The magnitude of reported ORs (Odd Ratios) proves the strong 
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association between plasma DNA concentration and NSCLC risk, despite wide 
confidence limits. Similar OR values were never previously reported for any 
biological marker and could be of substantial benefit in clinical practice. 
Increased amounts of circulating plasma DNA were observed in samples from 
5 any stage and tumor size. This is particularly relevant for small lesions, whose 
systematic detection could help reducing lung cancer morbidity and mortality. 

One important aspect of the quantitative analysis provided by the 
invention is the possibility to follow longitudinal changes after cancer 
resection. The data available on 35 cancer patients showed a rapid decrease of 

10 circulating DNA values after lung resection. On the contrary, no decreasing or 
increasing levels of plasma DNA identified subjects with recurrence of their 
disease (24.7 ng/ml vs. 7.1 ng/ml in cancer- free patients, p=0.002), suggesting 
that quantification of plasma DNA might represent a novel approach to 
monitor surgical patients or assess treatment efficacy after chemoradiotherapy. 

15 To explore the possible modulation of free DNA release by smoking 

exposure, 20 never smokers over 55 yrs of age were analysed: their median 
DNA value was 0.61 ng/ml. The same analysis, carried out on the high-risk 
group from the case-control study (heavy smokers of age above 50, with 
exposure to smoke >20 pack/year) showed an average value of 4.6 ng/ml. 

20 These results indicate very low amounts of free circulating DNA in unexposed 
people. 

The results of the clinical studies show the effectiveness of the method 
for the quantitation of circulating DNA of the invention in the early diagnosis 
of lung cancer in high-risk subjects, in particular in heavy smokers. 
25 DESCRIPTION OF THE FIGURES 

Figure 1, Amplification plots of the real time quantitative PCR for hTERT in 
plasma DNA from NSCLC patients (A) and control samples (B). The relative 
amount of plasma DNA is much higher in patients samples (left plots) 
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compared with controls (right plots). 

Calculation of the amounts of plasma DNA is based on the cycle number, 
where fluorescence of each reaction passes the threshold (CT), which is set to 
the geometric phase of the amplification above the background. The X axis 
5 denotes the cycle number of a quantitative PCR reaction. The Y axis denotes 
the log of fluorescence intensity over the background (ARn). 
Figure 2. Box-plots of plasma DNA concentration in NSCLC cases and 
matched controls. The box is bounded below and above by the 25% and 75% 
percentiles, the median is the solid line in the box, the lower and upper error 
10 bars indicate 90% of values. Mini = minimum value detected, max = 
maximum value detected, sd = standard deviation 

Figure 3. ROC curves for the diagnosis of NSCLC using plasma DNA values. 
Curve and area under the curve (AUC) estimated using the logistic procedure 
in SAS (Cary, NC). 
15 MATERIALS AND METHODS 

Case and control series 

One-hundred consecutive cases of non small cell lung cancer (NSCLC), 
81 men and 19 women, non previously treated with chemo-radiotherapy and 
included in the European Institute of Oncology (IEO) tissue bank from 2000 

20 to 2001, were evaluated. One control was selected for each case, matched by 
sex, age and smoking habits. 

Mean age +- sd was 65.1 +- 8.9 years in cases and 64.1 +- 8.2 in 
controls, average smoking duration was 40.5 +- 10.9 years in cases and 
41.7 +- 9.5 in controls. The population included 7 case-control pairs of never 

25 smokers. The 93 heavy smoker controls were selected among the participants 
to the IEO early detection program, whose chest spiral CT proved to be 
negative. Non smoking controls were recruited from healthy blood donors at 
the immuno-hematology unit of the Institute. The number of former smokers 
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differed slightly between cases and controls (28 cases and 1 1 controls). 

One light and occasional smoker case (< 5 cigarettes per day) was 
matched to a never smoker. A mean within-pair difference of 1 year in age 
was statistically significant (p<0.02), suggesting the need to adjust for age in 
5 the analysis of plasma DNA as risk factor. 

Samples collection and DNA isolation 

A 7.5 ml sample of peripheral blood was EDTA collected from patients 
prior to surgery and from controls at the time of spiral CT examination, and 
stored at -140°. Plasma separation and DNA extraction were performed as 
10 previously reported (25). The DNA purified from 1 ml of plasma was eluted in 
a final volume of 50 \\X of water. Testing of plasma DNA was performed 
without knowledge of the case / control status. 

DNA Quantification in Plasma 

To quantify the circulating DNA in plasma we used a real-time 
15 quantitative PCR approach based on the 5' nucleotide method. This 
methodology is based on continuous monitoring of a progressive fluorogenic 
PCR by an optical system (28'29). The PCR system uses two amplification 
primers and an additional, amplicon-specific, and fluorogenic hybridization 
probe whose target sequence is located within the amplicon. 
20 The probe is labeled with two fluorescent dyes. One serves as a reporter 

on the S'-end (VIC™ dye). Its emission spectrum is quenched by a second 
fluorescent dye at the 3 '-end (TAMRA ). If amplification occurs, the 5' to 3' 
exonuclease activity of the AmpliTaq® DNA polymerase cleaves the reporter 
from the probe during the extension phase, thus releasing it from the quencher 
25 (30). The resulting increase in fluorescent emission of the reporter dye is 
monitored during the PCR process. 

Primers and probes were designed to specifically amplify the ubiquitous 
gene of interest, the human Telomerase Reverse Transcriptase (hTERT) single 
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copy gene mapped on 5p 15.3 3. The amplicon size of the hTERT gene was 98 
bp (position 13059-13156, of the GenBank as accession no AF128893). The 
sequences of the primers and of the probe were the following: primer forward: 
5'-GGC ACA CGT GGC TTT TCG-3'; primer reverse: 5'-GGT GAA CCT 
5 CGT AAG TTT ATG CAA-3'; probe: VIC5'- TCA GGA CGT CGA GTG 
GAC ACG GTG-3 ' TAMRA. 

Fluorogenic PCRs were carried out in a reaction volume of 50 |il on a 
GeneAmp® 5700 Sequence Detection System (Applied Biosystem, Foster 
City, CA). Fluorogenic probe and primers were custom synthesized by 

10 Applied Biosystems. Each PCR reaction mixture consisted of: 25 fil of 
TaqMan® Universal Master Mix Applied Biosystem; 0.67 jj.1 probe [15 mM]; 
0.45 jllI primer forward [10 mM]; 0.45 \xl primer reverse [10 mM]; 18.43 \xl 
sterile water. 5 jul of DNA solution was used in each real-time PCR reaction. 
Thermal cycling was initiated with a first denaturation step of 50°C for 2 min 

15 and then 95°C for 10 min. The thermal profile for the PCR was 95°C for 15 s 
and 60°C for 1 min. Data obtained during 50 cycles of amplification were 
analyzed. 

Amplifications were carried out in 96-well plates in a GeneAmp® 5700 
Sequence Detection System (Applied Biosystem). Each plate consisted of 

20 patient samples in triplicates and multiple water blanks as negative control. 
For construction of the calibration curve on each plate we used a standard 
TaqMan Control Human Genomic DNA at 10 ng/pl (Applied Biosystem) with 
appropriate serial dilutions at 50 ng, 5 ng, 2.5 ng, 0.5 ng, 250 pg, 50 pg and 10 
pg. Linear amplification down to the last dilution point representing 10 pg of 

25 target DNA was obtained in each experiment (Correlation coefficient 0.999- 
0.995; slope 3.25-3.35). 

All the data were analysed using the Sequence Detection System 
software of Applied Biosystem to interpolate the standard amplification curve 
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of DNA at known quantity with amplification CT of the unknown target 
sample thus obtaining the relative amount of DNA in the experimental sample. 

For the follow-up study, all the consecutive plasma samples for each 
patient were simultaneously analysed in the same real-time PCR experiment to 
5 allow comparative quantification of samples along the observation time. 

Pathological and immunohistochemical methods 

Clinico-pathological data were available for all cases. There were 58 
adenocarcinomas, 34 squamous cell carcinomas, 3 large cell carcinomas, 3 
pleomorphic carcinomas, and 2 adenosquamous carcinomas. According to the 

10 WHO classification of lung adenocarcinoma (31), 19 (32.6%) showed acinar 
growth pattern, 19 papillary, 17 (29.3%) solid and 3 (5.2%) bronchioloalveolar. 
According to the revised lung cancer staging system (32), tumor stage was 
pTl in 18%, pT2 in 55%, pT3 in 21% and pT4 in 6%; forty-seven per cent of 
patients were pNO, 20% pNl and 33% pN2 orN3. Stage I accounted for 34%, 

15 IIB 25%, IIIA 33%, and IIIB/IVB 8%. In three cases, the analysis was 
performed only on mediastinal nodes metastases. 

In every case, the occurrence of either tumor necrosis or lymphoid 
infiltrate was evaluated semi-quantitatively on a scale from absent to 2+ (1+ if 
<50%, and 2+ if >50% of the whole tumor). For immunohistochemical 

20 analysis, formalin-fixed and paraffin-embedded samples obtained at surgery 
were investigated for cell apoptosis-related (p53) and tumour growth (CD- 
117, Ki-67, and microvessel density) markers, according to previously refined 
methods (33). All cases were evaluated blindly without knowledge of the 
patients' identity, pathological diagnosis, clinical outcome or plasma DNA 

25 status. The percentage of p53-, EGFR-, CD117- and Ki-67-immunoreactive 
tumor cells was evaluated by scoring a minimum of 1,000 tumor cells in 
representative fields of immunostaining. Tumor neoangiogenesis was inferred 
by CD34 immunostaining of endothelial cells as previously described (34). 
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Statistical Methods 

The distribution of DNA values revealed a departure from normality 
that was mitigated using a logarithmic transformation. The log of the 
concentration was used for testing purposes, however untransformed values 
5 were used for reporting results. 

Odds ratios (OR) and corresponding 95% confidence intervals were 
calculated using conditional logistic regression in SAS (Cary, NC, USA) to 
assess plasma DNA as a risk factor for NSCLC. A receiver-operating 
characteristic curve (ROC) was built to evaluate the diagnostic performance of 
10 plasma DNA concentrations. Each unique DNA value was used as cut-point to 
calculate sensitivity and specificity values defining the curve and the area 
under the curve (AUC). Standard errors were estimated separately as 
described in van der Schouw (35) in order to provide a 95% confidence 
interval for the area. 

15 In cases, the possible association of DNA concentration with continuous 

clinical and pathologic parameters was explored calculating Pearson's or 
Spearman's correlation coefficients, depending on the distribution of the 
parameters (i.e. age and distribution of EGFR marker respectively). 
Comparison of median DNA plasma concentrations in patients with follow-up 

20 data was done using the Kruskal-Wallis test. 
RESULTS 

Quantitative analysis of circulating plasma DNA in cancer patients and 
controls 

Figure 1 shows amplification plots of fluorescence intensity against the 
25 PCR cycle from plasma samples of cancer patients and matched controls. Each 
plot corresponds to the initial target DNA quantity present in the sample. The 
amplification curves shifted to the right, representing reduced target DNA 
quantity, clearly discriminate controls from cancer patients. The distribution 
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of plasma DNA concentration in cases and matched controls describes two 
distinct populations of values despite some overlapping (Figure 2). Median 
concentration in cases (24.3 ng/mi) was almost eight times the value detected 
in controls (3.1 ng/ml). Very high concentrations were observed only in cases, 
5 whereas at the other end of the distribution there were few cases with very low 
concentrations of DNA (i.e. 0.5 ng/ml). A greater variability of circulating 
DNA was observed in cases than in controls (see Figure 2). 
Plasma DNA concentration as risk factor for NSCLC 
An elevated concentration of circulating plasma DNA was associated 
10 with a higher risk of NSCLC. Tertile stratification showed that the risk 
increased exponentially when comparing subjects with plasma concentrations 
in the second and third tertile to those in the first tertile using conditional 
logistic regression (up to 85-fold, Table 1). When analyzed as continuous, a 
unit increase in plasma DNA (ng/ml) was associated with a 21% increase in 
15 NSCLC risk (OR=1.21, 95% cl=l.ll, 1.31). Circulating DNA concentration 
was associated with age in cases (age coefficient when regressing the log of 
the plasma concentration=0.029, standard error=0.013, p=0.026). No association 
was observed between plasma DNA levels and smoking intensity or duration. 
Diagnostic performance of quantitative real-time PCR assay 
20 The area under the ROC curve shown in figure 3 was 0.94 (95% 

confidence interval 0.907-0.973) suggesting a strong discrimination power of 
the molecular assay. Shown in table 2 are few of the DNA concentration cut- 
points used to generate the curve with their sensitivity, specificity, positive 
predictive value and negative predictive value. The 95% confidence intervals 
25 around sensitivity estimates overlapped between the successive concentrations 
shown in table 2, except for the last two concentrations (20 and 25 ng/ml). 
Correlation of plasma DNA levels with clinico-pathological features 
Among the morphological and immunoistochemical markers analysed, a 
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significant association with EGFR expression as well as with microvessel 
density were observed (table 3). The proliferation marker Ki67 increased with 
increasing plasma DNA concentration, although the correlation was not 
statistically significant (p=0.15). Age was the only clinical parameter that was 
5 significantly associated with DNA release in plasma. No association between 
plasma DNA levels and necrosis, lymphoid infiltration or growth patterns 
were observed. 

Change in plasma DNA levels during follow-up 

In 35 out of the 100 cancer patients evaluated for the amounts of plasma 
10 DNA prior to surgery for Stage I-II tumors, a second plasma sample was 
obtained and analysed to monitor changes in DNA levels during clinical 
follow-up, 3 to 15 months after surgery (median elapsed time 8 months). The 
overall median DNA concentration in follow-up plasma samples was 8,4 
ng/ml, showing a clear trend versus reduction when compared to median 
15 baseline levels of 24.5 ng/ml (p<.0001). When comparing these patients 
according to their clinical status, median DNA concentration at follow-up was 
significantly lower in 30 disease-free subjects as compared to 5 cancer 
patients showing relapse (7.1 ng/ml versus 24.7 ng/ml; p=0.002). 

Table 1. Plasma DNA concentration as risk factor for NSCLC 

20 

Controls Controls Odds Ratio 95% CI 

N=100 N=100 Conditional * 

Distribution in tertiles of DNA in ng/ml f 

<=4 4 62 1% 

4.1-20 27 36 5.5 (1.9- 16.3) 

>20 69 2 85.5 (16.5-445) 

* adjusted for age 

f Obtained from the pooled distribution of plasma DNA values in cases and in 
controls. 

% Reference group 
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Table 2. Screening performance of plasma DNA concentration 



Cut-point Sensitivity 95% CIt Specificity 95% CIt PPV NPV 
(ng/mL)* 



4 


97 


91.5-99.4 


60 


49.7-69.7 


71 


95 


7 


92 


84.8-96.5 


77 


67.5-84.8 


80 


91 


10 


88 


80.0-93.6 


86 


77.6-92.1 


88 


88 


15 


78 


68.6-85.7 


95 


88.7-98.4 


94 


81 


20 


69 


58.9-77.9 


98 


93.0-99.8 


97 


76 


25 


79 


35.9-56.3 


99 


94.5-100 


98 


65 



PPV = positive predictive value, NPV = negative predictive value 

* Contrasting concentrations lower than the cutpoint vs concentrations equal 

to or higher than the specified value 

f Sensitivity and specificity estimates were treated as binomial parameteres to 

calculate 95% confidence intervals using Statxact (Cambridge, MA) 

Table 3. Correlation of plasma DNA levels with clinico-pathological parameters 



Log (DNA concentration) 




Age 


Smoking 
years 


LogMVD 


Ki67 


Egfr 


Coefficient* 


0.22 


0.04 


0.25 


0.15 


0.2f 


P Value 


0.026 


0.71 


0.016 


0.15 


0.044 


n% 


100 


93 


97 


97 


97 



* Pearson's correlation coefficients 
f Spearman's correlation coefficients 

% Sample size reduced to 93 controls of smokers (current or former) and 97 
controls with primary cancer samples. 



